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Abstract 

We consider a set up where a file of size M is stored in n distributed storage nodes, using an (n, k) minimum 
storage regenerating (MSR) code, i.e., a maximum distance separable (MDS) code that also allows efficient exact- 
repair of any failed node. The MDS property ensures that the original file can be reconstructed even if any n — k 
storage nodes fail. When a node fails, a new node collects data from the remaining n — 1 healthy nodes and repairs 
the failed node. The problem of interest in this paper is to minimize the repair bandwidth B for exact regeneration 
of the failed node, i.e., the minimum data to be downloaded by the new node to replace the failed node by its exact 
replica. Previous work has shown that with random network coding, a bandwidth of B = ^^^Z^^ is necessary 
and sufficient for functional (not exact) regeneration, i.e., if the repaired new node need not be exactly identical to 
the failed node, but only information equivalent to it. It has also been shown using interference alignment based 
techniques that if k < max(n/2,3) then, surprisingly, there is no extra cost of exact regeneration over functional 
regeneration and the same repair bandwidth of ^^^Z^^ suffices for exact regeneration. The practically relevant 
setting of low-redundancy, i.e., k/n > 1/2 remains open for A: > 3 and it has been shown that there is an extra 
bandwidth cost for exact repair over functional repair in this case. In this work, we adopt into the distributed storage 
context an asymptotically optimal interference alignment scheme previously proposed by Cadambe and Jafar for 
large wireless interference networks. With this scheme we solve the problem of repair bandwidth minimization 
for (n, /c) exact-MSR codes for all (n, /c) values including the previously open case of k > max(n/2,3). Our 
main result is that, for any (n, k), and sufficiently large file sizes, there is no extra cost of exact regeneration over 
functional regeneration in terms of the repair bandwidth per bit of regenerated data. More precisely, we show that 
limM^oo ^ = kin'-k) • result is analogous to the wireless interference channel setting where exact interference 
alignment through linear beamforming is seen to be infeasible for more than 3 users, but almost perfect alignment is 
achieved asymptotically by the Cadambe-Jafar scheme over a large number of signaling dimensions for any number 
of users. 



I. Introduction 

The problem of interest in this paper is to minimize the bandwidth required to exactly repair failed nodes in 
distributed storage systems. It is well known that maximum distance separable (MDS) codes can be used to reliably 
store data in distributed storage nodes. To see this, consider a scenario where a file of size M is to be stored in n 
distributed storage nodes. The file is split into k equal parts of size M/k and stored in the first k storage nodes, 
also known as systematic nodes. The remaining (n — k) nodes, known as parity nodes or non-systematic nodes, 
store data of the same size, i.e., M/k, adding redundancy to protect from failure of storage nodes. The parity 
nodes are designed so that a failure of up to (n — k) storage nodes can be tolerated, i.e., the original file can be 
completely recovered from the data stored at any k nodes out of the original n nodes. Clearly, for this problem, 
storing the data using an (n, k) MDS code suffices to achieve the required reconstruction criterion, since an MDS 
code protects the data from (n — k) erasures. Now, consider the case where only 1 node fails, and a new node is 
introduced to replace the failed node. The total amount of data to be downloaded by the new node to regenerate a 
single failed node will be henceforth referred to as the repair bandwidth. Clearly, a repair bandwidth of M suffices 
to repair a failed node since the new node can download data of total size M from any k of the remaining n — 1 
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Fig. 1. Pictorial Representation of Problem Definition for n = 4, /c = 2 
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healthy nodes to reconstruct the failed node. However, note the inherent inefficiency in the solution - to reconstruct 
a node of size M/k, the newcomer downloads data of size M, i.e., k times the size of the data to be repaired. A 
question of interest is whether this inefficiency is fundamental, or whether the node can be repaired with the new 
comer downloading data of size less than M. More specifically, the question of interest of this paper is what is 
the minimum repair bandwidth required to repair a failed node? The question of minimum repair bandwidth has 
been studied previously from two perspectives f\\, f2], |3|, [4J, L5J. The first is called functional regeneration LU, 
121 and the second is called exact (or systematic) regeneration O, lH, ||5l. 

The functional regeneration problem requires the new node to replace the failed node by a function of the data, 
so that the reconstructed new node, along with the other nodes satisfy the property of being an (n, k) MDS code. 
In other words, the repaired node is information equivalent to the originally stored data. Note that in the functional 
regeneration problem, the data stored by the repaired node need not be identical to the data stored by the failed 
node; all that is required is that the repaired node along with the other nodes forms a MDS code. This problem 
has been shown to be equivalent to finding the capacity of a particular wired single-source multi-cast network. 
Since network coding achieves the cut-set bound in a single-source multi-cast network, the functional regeneration 
problem has been solved, and it is shown in [1] that the minimum bandwidth required is S = ^{n-k) ' ^^^^ ^^^^ 
since ^I^^I^^ is smaller than M (for k < n), the solution trivially implies that reconstruction of a single failed 

node requires a smaller repair bandwidth than reconstruction of (n — k) failed nodes, by a factor of ^^^z^. 

The focus of this paper is on the exact (or systematic) regeneration problem, where the new comer is required 
to replace the failed node by a replica, i.e., an identical copy of the failed node. Since exact regeneration ensures 
that a failed systematic node is replaced by a systematic node, the systematic structure of data storage is retained. 
There is a practical advantage of preservation of the systematic structure which ensures easy access to the data 
for a client, since the client can simply download it from the k systematic nodes without any decoding. Note that 
the constraints for systematic or exact regeneration are stricter than the functional regeneration problem. Since any 
solution for the exact regeneration problem is also a solution to functional regeneration problem, ^j^^Zl) serves as 



a lower bound to the minimum repair bandwidth for the exact repair problem. However, if the repair bandwidth of 

M(n-l) 
k{n—k) 



suffices, in general, has been an open question. It is this open question that is the focus of this paper. 



A. Related Work and Summary of Contributions 

The exact regeneration problem was formulated and solved for the special case of n = 4, A: = 2 in ||3]|. The 
solution was further extended to the more general case ofk < max(3, n/2) in ||5l. The results in all these cases 
yield the same surprising conclusion: there is no price for exact regeneration over functional regeneration, and a 
repair bandwidth of ^(^^Z^-^ suffices even for exact regeneration. The solution for these cases stems from drawing 
parallels between the exact regeneration problem and the wireless interference channel [4J. Such parallels enable the 
use of the interference management technique of interference alignment ||6]|, Q for the exact regeneration problem. 
However, prior to this work, as far as we are aware, little was known about the minimum repair bandwidth for 



k > max(3, |). From a practical perspective, note that the previously unsolved case of k > max(3, |) is important 
because this case corresponds to the amount of parity data (i.e., number of parity nodes) being smaller than the 
original file size (number of systematic nodes). This case is briefly studied in reference |5| where for /c > f + 1, 
it is shown that the lower bound of ^^^Z^^ cannot be achieved using linear codes. The main contribution of this 
paper is to make progress in this open problem drawing inspiration from the interference alignment solution for 
the K user wireless interference channel in |81. We argue that, while that lower bound on the repair bandwidth of 
^(n-k) sufficient in general as noted in [5 |, the repair bandwidth per bit of repaired data can indeed 

achieve this lower bound in the limit of large file sizes for any k < n. More precisely, we show that 

B n-1 
lim — - = 



M^oo M k{n - k) ' 

i.e., for any k < n, with sufficiently large amount of data, there is no cost of exact repair over functional repair in 
terms of repair bandwidth per bit of repaired data. An interesting insight of our solution is that the size of the symbol 
extension in wireless interference channels is analogous to the file size of our solution. Reference 1 8 1 shows that the 
optimal number of degrees of freedom of the interference channel cannot be achieved with finite symbol extensions 
(using linear schemes), but can only be achieved asymptotically in the limiting case of arbitrarily large symbol 
extensions. This is analogous to our result combined with that of reference |5|. The reference shows that the bound 
of ^{n-k) ^^^^^^ be achieved exactly. Here, we complete the analogy between file size and symbol extensions in 
our main result by showing that while the bound is not exactly achievable, it is achievable asymptotically in the 
limit of large file sizes. We state our main result formally below. 

Theorem 1: Consider any tuple (n, k) such that n > k. For a file of size M stored in n distributed storage nodes 
as a part of a (n, k) MDS code, the minimum repair bandwidth B for exact regeneration of a (single) failed node 
satisfies 

B n-1 
lim — = . 

M^oo M k{n - k) 

Equivalently, we can write 

Our approach to the problem is different from those of ||3, which use Cauchy matrices for code construction for 
k < max(3, n/2). Before we proceed to prove the theorem, we take a closer look at a simple case of (n = 4, = 2), 
initially presented in |3J, to describe the role of interference alignment in exact regeneration. 



IL The Role of Interference Alignment in Exact Regeneration : n = 4, A: = 2 

Consider the case where the n = 4, A: = 2. Further, let the file to be stored be [xi X2]^ where x^, z = 1, 2 are 
2x1 vectors over a finite field of size q denoted by ¥q. 

Remark 1: Note that for sufficiently large file sizes, the field size g is a design parameter. For the solution for 
n = 4, A: = 2 presented here, any prime q >3 suffices. Also by defining xi, X2 to be 2 x 1 vectors, we are assuming 
that the file-size M = 4, i.e., 4 scalars over the field. For large file sizes, M can be treated as a design parameter, 
since a code for a specified M can be used for larger files by splitting the file into portions of size M. 

As in Figure [2} the first systematic node stores xi and the second systematic node stores X2, Let the parity nodes 
m = 3,4 store vectors of the form A^^ixi + A^ 2X2, where A^^^,i = 1, 2, m = 3, 4 are 2 x 2 matrices. Now, 
consider the case where A3^2 = ^4,2 = I and 
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Note that xi, X2 can be reconstructed from any A; = 2 of the n = 4 nodes. Now, consider the case where the first 
node fails. First, we present a naive solution which does not align interference. Note that the contents of the node, 
i.e. xi can be reconstructed by a new comer downloading 4 linear combinations (or equations) from any other 
two nodes. For example, it can be reconstructed by downloading the 2 x 1 vector X2 from node 2 and the 2 x 1 
vector Ai^ixi + Ai^2X2 from node 3. In this case, note that among the 4 dimensions (corresponding to the 4 linear 
equations) at the new comer, two dimensions are occupied by the data to be reconstructed, i.e., the desired data xi 
and two dimensions are occupied by the undesired data or interference X2. 
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Fig. 2. Alignment based Exact Repair for n = 4, /c = 2 f3\ 



However, a more efficient solution exists. By aligning the interference into 1 dimension, we can see that the 
node can be repaired by downloading only 3 linear combinations of the stored data from the remaining healthy 
nodes. To see this, consider the case where the new comer downloads a total of 3 linear combinations of the stored 
data, one from each remaining healthy node, as follows. 

• v|^X2 from node 2 

• Vg" (Aa^ixi + A3^2X2) from node 2 

• vj (A4^ixi + A4^2X2) from node 3 

where V2 = V3 = V4 = [1 1]^ (See Figure [2]). As shown in Figure [2j since 

V2 Ai^2V3 Ai^3V4 

the interference aligns into 1 dimension at the new comer. Further, since 

rank([A^]^V3 A4 ]^V4]) = dimension of xi = 2, 

and [Ag^^va A4 ^V4] is linearly independent of the aligned interference V2, the data storage node can reconstruct 
the desired two dimensional vector xi from the received 3 dimensional vector. The code can be shown to exactly 
repair any failed node with a repair bandwidth of 3, i.e., with the new comer collecting 3 linear combinations from 
the healthy nodes. We now proceed to extend this for general values of (n, k) and prove our main result. 

III. Proof OF Theorem [H 

We begin by generalizing the setting described in the previous section. The total data is represented by the 
M/k X k dimensional matrix [xi X2 ... x/^], where x^ is an M/k x 1 dimensional vector stored by systematic 
node i G {1,2,..., k}. Node j, where j G {/c + 1, /c + 2, . . . , n} being a parity node stores the M/k x 1 vector 
Aj^ixi + Aj^2X2 + . . . + Aj^/eX/e, whcrc Aj^i is a M/k x M/k square matrix for z G {1, 2, . . . , k}. Henceforth, we 
assume that for j < k, 

Ai,i = | 5 ^jtl ,Vie{l,2,...,fc}. 

The above assumption implies that the data stored in node j G {1, 2, . . . , n} is the M/k x 1 vector 

k 

1=1 



Note that Aj^i for j = A: + 1, A; + 2, . . . , n are a design choice that define the code; these matrices will henceforth 
be referred to as the coding matrices. We need to choose these matrices so that the code is an MDS code, i.e., 
using any subset of k nodes, the entire M x 1 vector of data must be reconstructable. Thus, we need to ensure that 
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(1) 



for any distinct ji, j2, . . . , j/c G {1, 2, . . . , n}. 

Now, when a node fails, the new comer collects a /3 



1 vector from each of the remaining (n — 1) healthy 



nodes where /3 = so that the total repair bandwidth is B. Our goal is to find the coding matrices Aj^^, (j, i) G 
{/c + 1, A: + 2, . . . , n} x {1, 2, . . . , A:} and design the /3 x 1 vector to be downloaded by the new comer so as to 
meet the required bound (presented in the statement of the theorem). We now describe our solution assuming that 
a systematic node fails. We will later describe how the solution can be adapted to repair failures of parity nodes. 
Without loss of generality, let us assume that node 1 fails. We provide a linear solution to this problem, so that the 
/3 X 1 vector downloaded by the new comer from node j > 1 to repair node 1 is VjDj, where Vj is a M/k x /3 
matrix. The matrices Vj will be henceforth referred to as the repair vectors. The new comer now has to regenerate 
the M/k X 1 vector xi using (n — 1) vectors of the form VjDj, j = 2, 3, . . . , n — 1, each of dimension /3 x 1. 
Notice that the (A: — 1) vectors (of dimension /3 x 1) downloaded using the (A: — 1) systematic nodes do not contain 
any information about the desired vector xi and can be interpreted as interference. Therefore, the new comer has, 
apart from the interference, (n — A:) vectors of dimension /3 x 1 containing linear combinations of the desired data. 
Thus, the vectors available at the new comer can be described as follows. 

• (A: — 1) vectors of the form Vjxj, 1 < j < k - these vectors are downloaded from the (A: — 1) healthy 
systematic nodes. They contain no information about the desired data, and will be used to cancel interference. 

• {n — k) vectors of the Vj ^^^^ Aj^^x^, A: < j < n - these vectors contain both the desired signal and 
components of the interference. 

The goal of our solution will be to completely cancel the interference from the latter (n — A:) vectors using the 
former (A: — 1) vectors listed above, and then to regenerate xi using the latter (n — k) vectors. In order to completely 
cancel the interference related to x^ using Vf x^ by linear techniques, we will need, Vj = A: + 1, A: + 2, . . . , n, and 
for some /3 x /3 matrix Aj, 
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where ([3]) follows from the fact that the matrices Aj^i and are picked independent of the data x^, and therefore 
need to satisfy (|2]) for any data vector x^. 

While the above condition ensures that the entire interference can be cancelled, we also need to ensure that, on 
interference cancellation, the (n — A:) vectors of dimension /3 x 1 are sufficient to reconstruct xi. Note that after 
interference cancellation, each of the (n — A:) vectors is of the form VjAj^iXi, for j = A: + 1, A: + 2, . . . , n. For 
linear reconstruction, we need 

^^+2^^+2,1X1 
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for some M/k x {n — k)(3 matrix Therefore, we need 

colspan([A^+iiVfc+i A^+2,iVfc+2 ... A^^^V^]) = y (5) 

Therefore, our goal is to design Aj^i and V/ for j G {/c + 1, /c + 2, . . . , n}, z E {1, 2, . . . , A:}, / = 2, 3, . . . , n so 
that 

• The code is a (n, A:) MDS code. 

• The interference is aligned appropriately so that it can be completely cancelled. 

• The desired signal xi can be regenerated at the new comer. 

Thus, essentially we need to pick Aj^i and for j G {/c + 1, /c + 2, . . . , n}, z G {1, 2, . . . , A;}, / G {2, 3, . . . , n} so 
that ([T]), (|4]) and (|5]) are satisfied. Further, as noted in Remark 1, the field size q and M are also design choices 
(for large file sizes) that we can use to satisfy these conditions. 

A. The solution : Choosing Aj^,^, V/, M and q 

For k < max(3,n/2), the solutions of 1(41, ||5l design these matrices using Cauchy matrices to satisfy these 
conditions. Here, note that the conditions Q, (|5]) are similar to the interference alignment conditions in the 
interference channel Specifically, (|4]) is analogous to the condition that all the interference must align in the 
K-usQY interference channel, and (|5]) is similar to the condition that the desired signal must be linearly independent 
for linear decoding in the interference channel |8|. These parallels with enable us to build a solution based on the 
asymptotically perfect interference alignment scheme of the same reference. 

On noting that there are F = (n — — 1) alignment equations in Q, like in |[8l, we choose M = k{n — k)A^ 
and /3 = (A + 1)^, where A > 1 can be any integei[^ For any value of A, we show the existence of a field size q, 
matrices Aj^^, i G {1, 2, . . . , A:}, j G {A: + 1, A: + 2, . . . , n} and V/, / G {2, 3, . . . , n} so that ([T]), Q, ^ are satisfied 
and the failed node can be repaired. Finally, we show that our code can be used to repair non-systematic nodes 
as well. Before we proceed to give a random coding based construction of the coding matrices and repair vectors, 
we will evaluate the repair bandwidth achieved by our scheme. Noting that our construction is applicable for any 
value of A, we can make A, a design parameter, arbitrarily large. As A ^ oc, we have M ^ oo and 



M^cx) M A^cx) k{n - k)A^ k{n - k) 

We now proceed to explain our construction of coding matrices and repair vectors satisfying the constraints 
of repair ([T]), (|4]), ([5]). Our solution, unlike those in references El, ||5l, is a random coding solution. Specifically, 
we choose the coding matrices Aj^^, i G {1, 2, . . . , A:}, j G {A: + 1, A: + 2, . . . , n} randomly. We then provide an 
expression for V/, / G {2, 3, . . . , n} as a (random) function of Aj^i so that ^ is satisfied. Then we show for large 
field size q, that ([T]) and ([5]) are satisfied with a non-zero probability. This implies that there exists at least one 
choice of coding matrices Aj^i so that all the desired condtions, i.e., ([!]), (|4]), ([5]) are satisfied. 

Design of Coding Matrices, Aj^i: The alignment constraints, Q, are similar to the alignment constraints for 
the interference channel (See equation (50) in [SJ). Note that the matrices Aj^i play a role analogous to channel 
matrices in wireless interference channels [[8l. Drawing inspiration from |8|, we choose the M/kx M/k dimensional 
matrices Aj^iij = A: + l,A: + 2,...,ntobe random diagonal matrices with each diagonal entry of each matrix 
chosen independently and uniformly distributed over the non-zero elements of the field Fq. In other words, we 
choose 



a]^ ... 
d ... 



... aA 



(6) 



with all the diagonal entries chosen independent of each other and independent of all the diagonal entries of all 
other coding matrices, i.e., with chosen independent of a^-. from the non-zero elements of the field, for all 

i ^ i or j ^ j or m ^ rh, where z, z G {1, 2, . . . , A:}, j, j G {A: + 1, A: + 2, . . . , n} and m, m G {1, 2, . . . , ^}. Note 
that all the coding matrices are full rank since all the diagonal elements are non-zero. We later show that that this 
code is an MDS code with non-zero probability. 



^The intuition for these choices of M and /3 will hopefully become clear later in this section for a reader unfamiliar with (8). 



Design of Repair Vectors, V^; Here, we provide a set of repair vectors that satisfy (|4]). We first set the columns 
of vectors (which are analogous to beamforming vectors in interference channels) 

V2 = v. = . . . = V. = v' 



k+1 



k+2 



where, V and V' are M/k x /3 dimensional matrices. Then the relations Q can be re-written as 

colspan(Aj^iV) C colspan(V ), z = 2, 3, . . . , A: (7) 

for j = + 1, + 2, . . . , n. Note that there are {k — l){n — k) = T conditions contained in We wish to find 
V, V' so that all these conditions are satisfied. 

Intuitive understanding of asymptotic alignment: Before we provide precise expressions for V,V', we will 
intuitively explain the extent of alignment required to to satisfy (|4]), ([5]). Since our bandwidth is restricted by /3, we 
need rank(V) < /3 = (A + 1)^ and rank(V') < (A + 1)^. Further, noting that Q implies Ej=fc+i rank(V^) > 
we get rank(V) > ^^j^^^ = A^. Therefore V must have at least A^ non-zero linearly independent columns. In 
order to satisfy dTj), the span of the FA^ non-zero column vectors of the matrix 



A^.V] 



1, 



[A/e+i,iV A;e+2,lV ... A^,iV Aa;+i,2V Afc+2,2V 

should align in the space spanned by the (A + 1)^ column vectors of V'. For large values of A, since 

and all the coding matrices have a full rank of M/k, we have ^^^^]^y'^^ ~^ ^ ^ ^ {A: + 1, . . . , n}, i G 

{1,2, . . . , A:}. From ^ this implies that colspan(Aj^iV) ^ colspan(V'). In other words, the alignment between 
the F matrices on the left hand side of the F relations indicated by ^ is asymptotically perfect for large A. Next 
we return to the mathematical construction of the alignment scheme. 

Following the arguments of |[8]|, ||9l, we choose the set of non-zero column vectors of V, V' as shown belowj^ 

r / \ 
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j=k+l,...,n 
2,...,fc 



w : a/c+1,2, •-, "n,/c e {0, 1, A - 1} 
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J=fc + 1,...,? 



w : 0:^+1,2, o^n.k ^ {0, 1, 2, A} 



(9) 



J 



where the entries of the M/k x 1 column vector w are chosen uniformly over the non-zero elements of the field 
and independent of all the coding matrices. 

Thus, the elements of V contain products of (diagonal) coding matrices corresponding to interference symbols 
contained in the parity nodes, with each matrix raised to an exponent that is allowed to take integer values from 
upto A — 1. Since there are F = {k — l){n — k) coding matrices and A possible distinct values for the exponent of 
each matrix, the total number of elements, i.e. column vectors, in V is A^. Similarly, the total number of column 
vectors in V' is (A + 1)^. To understand the notation better, consider, e.g., the case where A = 1. Then, V = w, 
i.e., just one column vector, and V' contains all the 2^ vectors of the form 



^k+l,n ^k+2,2 



where aj^i G {0, 1}. For any general value of A, the columns of V are of the form 



Oik + 1,2 A Oik + 1,3 



...A 



C^k + 2,2 



...A 



n,k 



W 



where aj^i G {0, 1, . . . , A — 1} and V' has columns of the form 



A (^k + 1,2 A Q;fc + 1,3 



A Cyk + l,n A ak + 2,2 

^k+l,n ^/c+2,2 



^For convenience, we ignore the abuse in notation of these equations; the quantity on the left denotes the matrix, whereas the quantity on 
the right only denotes the set of non-zero columns of the matrix. 



where aj^i G {0,1,..., A}. Note that the ordering of the matrices Aj^i in the above notation is irrelevant, since the 
coding matrices, being diagonal, commute. This commuting property is the key to the alignment scheme. Because 
the ordering of matrices is irrelevant, it is readily verified that multiplying any column vector from V by any of the 
Aj^i involved, produces a column vector contained in V' . This is because multiplication by Aj^i simply raises the 
corresponding exponent of the element in V by one, but the elements of V' already include all such terms. Since 
the set of columns of Aj^^V is a sub-set of the columns of V' for any j G {A: + 1, /c + 2, . . . , n}, z G {2, 3, . . . , k}, 
it is evident that this choice of repair vectors satisfies (|7]), and equivalently, (|4]). 

Proof of Q, (|5]).- We have now chosen coding matrices and repair vectors so that the alignment constraints ^ 
are satisfied. We now need to show ([T]) and (|5]). In order to show that the matrices of ([T]) and ^ are full rank, it 
is enough to show that their determinants are non-zero. Notice that the determinant of the matrix of ([T]), i.e., 

Aj,,i Aj^^2 ... Aj^^k 

^J2,l ^J2,2 . . . Aj2,fc 



Aj,,i Aj^^2 ... Aj^^k 



(10) 



is a polynomial in its entries. Note that there are (^) polynomials of this kind, which can be represented, for 
/ = l,2,...,©,as 

where 

^={<^:i^{fc + l,fc + 2,...,n},zG{l,2,...,fc},mG{l,2,...,(n-fc)Ar}}^F„ 

denotes all the diagonal entries of the coding matrices. In the appendix, we show that each of these polynomials 
is a non-zero polynomial. 

Similarly, we need to show ([5]), i.e., 

M 

colspan([A;e+i,iV Afc+2,iV . . . A^,iV]) = — 

where V has = k{rf-k) i^^n-zero columns chosen using Using these non-zero columns (i.e., discounting 
the columns of V which are zero) the above matrix is of dimension x ^ X- Therefore, to show that this square 
matrix has a full rank of we need to show that its determinant is non-zero. Since a determinant is a polynomial 
function of its entries, the determinant expansion above is a polynomial 

g : ^U{'?/;i,^2,...,^(n-/c)Ar} ^F^, 

where ^ ^ [wi W2 ... ^{n-k)/^A^ - An argument very similar to Lemma 1 of [H^ can be used to show that the 
polynomial formed by this matrix for our solution is a non-zero polynomial (See also Appendix III in |8|). Thus, 
the product /i(.)/2(.).../(^-^ (.)^(.) is non-zero polynomial of A^J {wi^W2^ • • • , ^(n-itjAr}. Using Schwartz-Zippel 
Lemma, for large enough q, we have at least one choice of coding matrices and repair vectors such that these 
polynomials do not evaluate to non-zero, and therefore a solution exists so that ([l]),(|5]) are satisfied. 

Repair of Non-Systematic (Parity) nodes: So far, we have discussed an achievable scheme for regenerating a 
systematic node. The code Aj^i constructed here can also be used to regenerate a failed parity node in the same 
manner. To see this, suppose that a parity node, say node /c + 1, fails. The new comer intends to regenerate D^^+i. 
Let 

k 

i=l 

Since the code is an MDS code, using a change of basis, we can write 

k 

xi = Ai + ^ Ai^^Xi 

1=2 
k 



n 



where A'j ■ are all diagonal. In other words, a change of basis can essentially transform the regeneration of a 
parity node to appear like regeneration of a systematic node, i.e., with nodes 2, 3, . . . , A: + 1 viewed as systematic 
nodes storing data X2, X3, . . . , x/^, x'^; nodes l,A: + 2,A: + 3,...,nare viewed as parity nodes using coding matrices 
A'j-^j ^ {2, 3, . . . , A; + 1}. Since all the coding matrices are diagonal, the problem can be solved in a similar 
manner as above, i.e, the vectors downloaded by the new comer can be constructed as in ([9]), ([8]) and be verified to 
satisfy a property similar to (|4]). The only thing that remains is to verify if a reconstruction criterion similar to ([5]) 
is satisfied. In order to show this, it is enough to show that all the (random) diagonal entries of any new coding 
matrix Aj- for j ^ {2, 3, . . . , A: + 1} are uniformly distributed over the non-zero entries of the field which are 
independent of each other and independent of all diagonal entries in the other new coding matrices, much like our 
original construction. Showing this independence property will ensure that our earlier proof of ^ is applicable. In 
order to show this independence property, we explicitly evaluate A'j ^ for j ^ {2, 3, . . . , A: + 1} as follows. 

j = 1,^ = 1 

-^kli,Ak+i,t j = 1, z G {2, 3, . . . , A:} 

Aj^.A'l^^^ jG{A: + 2,...,n},z = l 

G{A: + 2,...,n},zG{2,3,...,A:} 

Now, note that if Si and S2 are two diagonal matrices with their diagonal entries drawn independently and uniformly 
distributed over the non-zero elements of the field, then each of the matrices SiS2,Si + S2,— S2 and S^^ has 
diagonal entries uniformly distributed over the non-zero elements of the field. Further each of these matrices 
are independent of Si. This implies that all the diagonal entries of A'j-^j ^ {2, 3, . . . , A: + 1} are distributed 
independently of each other, and uniformly distributed over the non-zero elements of Fq. Also, this property can 
be used in [lu to verify that all the entries of any coding matrix A'j- are independent of the entries of all the 
entries of any other new coding matrix A~~ for i ^ i or j ^ j. For example, A^ ^ is independently distributed 

of A^ 2 since the entries of A/e+1,2 are independent of A/e+1,1 in our original code construction. Thus, the basis 
transformation preserves the required independence criteria and a property similar to (|5]) holds. This completes the 
proof. 

IV. Conclusion 

We have shown that, per bit of data to be reconstructed, surprisingly, there is no loss of exact regeneration over 
functional regeneration in terms of the amount of repair bandwidth per bit of repaired data, in the limit of large 
file sizes, regardless of the desired redundancy level. The result is in contrast with previous work in |5| where 
it is shown that there is an efficiency loss for exact regeneration over functional regeneration especially for low 
redundancy levels. However, note that the two results do not contradict each other. While our asymptotic alignment 
scheme can approach arbitrarily close to the cut-set bound on minimum repair bandwidth per bit of repaired data, 
the bound is not achieved with exact equality. Also unlike previous work in Hi, we do not provide explicit codes 
or specify the minimum field size, since our arguments are based on properties of random matrices. Directions for 
ongoing work include interference alignment solutions for exact repair for each point on the storage-bandwidth 
tradeoff curve. 

Appendix 

We intend to show that the determinant of the matrix in ([T]) is a non-zero polynomial in its entries. Assuming, 
without loss of generality, that ji, j2, ■ ■ ■ ^jk are in ascending order, let ji, j2, • • • ^jk-m ^ {I5 2, . . . , A:} and 
jk-m+iijk-m+2i • • • ijk ^ {A: + 1 , A: + 2, . . . , n}. Therefore, we need to show that the determinant of the following 




matrix is a non-zero polynomial of its entries. 





Ai,2 


M,k 


A o 1 

-^2,1 


A r> r> 

-^2,2 


A 7 


A/e— m,l 


A/c-m,2 • 


■^k—m^k 


A/e+1,1 


Afc+1,2 . 


• Ay^+i^yt 


A/c+2,1 


A/c+2,2 • 


• Afc+2,/c 


A/c-(- 777,^1 


A/c+rn,2 • 


A/g_|_7jj,^/j; 



Since 



Aj,i — 



j^i 

1 j = i 



,ViG{l,2,...,fc} 



we want the following matrix to be full rank. 



I M ^ M 

k 

M w M 
M ^ M 

A/c+1,1 



M - . M 

M - , M 
k^ k 

1 M ^ M 

^/cH-l,/c— m 



^k-\-m,k—m 



M - . M 

M ^ M 
M ^ M 
A/c+l,/c 



(12) 



Therefore, we essentially need to show that the determinant formed by the above matrix is non-zero. Since the first 



{k 



m 



M 



{k 



m 



M 



matrix is the identity matrix, expanding the determinant along the first {k 



the determinant can be shown to be equal to the determinant of the following matrix. 



k 



P = 



^/e+l,(/c-m) + l 
^/c+2,(/c-m) + l 



*-k-\-m,(k—m)-\-l 



^/c+l,(/c-m)+2 
^/c+2,(/c-m)+2 



^/c+m,(/c-m)+2 



A/c+i,fc 
A/c+2,/c 

■^k-\-m,k 



(13) 



We need to show that the determinant of the x matrix P is non-zero. Note that we have 



A ■ • — 



3,^ 


















3,^ 



(14) 



where each a^- is independent of aK, ., for i ^ i' or j ^ j' or I ^ l' . Since interchanging the rows or columns of 
a matrix does not change its determinant except for its sign, we make the row and column exchange operations 
to simplify P. Let the rows of P be ri, r2, . . . , YmM/k- Now, we only need to show that the determinant of P' is 
non-zero where 

^Ti(i) 

p' ^ ^-1(2) 

_ ^Wi(mM/k) 

where vri : {1,2,..., mM/k} — ^ {1,2,..., mM/k} is a permutation. Now, further, let ci, C2, . . . , CmM/k t>e the 

columnsof P. Wethen perform column exchange operationsofP toget the matrixP" = [c^2(i) ^tt2{2) ■■■ ^■K^imM/k)]^ 
where, 7r2 : {1, 2, . . . , mM/k} — )■ {1, 2, . . . , mM/k} is also a permutation. Now, that the determinant of P is non- 
zero is equivalent to showing that the determinant of P is non-zero. Choosing the permutations 7ri,7r2 as 



7ri(i) = 7r2(i) 



1 + 



i - 1 
_M/k 



+ 



i - 1 
_M/k 



M 
~k 



1 TO 



it can be verified that the x matrix P" has a block diagonal structure, with ^ blocks of size m x m. 
The zth block of P is . . 

^A:+l,(/c-m)+l ^fc+l,(/c-m)+2 ' ' ' ^k+l,k 
^/c+2,(/c-m)+l ^/e+2,(/c-m)+2 %+2,fc 

- ^k+m,{k-m)+l ^k+m,{k-m)+2 ' ' ' ^k+m,k _ 

Since the determinant of a block diagonal matrix is a product of the determinant of each of its blocks, and the 
determinant of the square matrix formed by the above block is a non-zero polynomial of its entries, the determinant 
of the matrix in ([T]) is a non-zero polynomial of its entries, as required. 
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